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ABSTRACT 

Escherichia coli RfaH activates gene expression by 
tethering the elongating RNA polymerase to the 
ribosome. This bridging action requires a complete 
refolding of the RfaH C-terminal domain (CTD) from 
an a-helical hairpin, which binds to the N-terminal 
domain (NTD) in the free protein, to a p-barrel, which 
interacts with the ribosomal protein S10 following 
RfaH recruitment to its target operons. The CTD 
forms a p-barrel when expressed alone or proteo- 
lytically separated from the NTD, indicating that 
the a-helical state is trapped by the NTD, perhaps 
co-translationally. Alternatively, the interdomain 
contacts may be sufficient to drive the formation 
of the a-helical form. Here, we use functional and 
NMR analyses to show that the denatured RfaH 
refolds into the native state and that RfaH in which 
the order of the domains is reversed is fully func- 
tional in vitro and in vivo. Our results indicate that all 
information necessary to determine its fold is 
encoded within RfaH itself, whereas accessory 
factors or sequential folding of NTD and CTD 
during translation are dispensable. These findings 
suggest that universally conserved RfaH homologs 
may change folds to accommodate diverse inter- 
action partners and that context-dependent 
protein refolding may be widespread in nature. 

INTRODUCTION 

RfaH is an operon-specific transcription factor that 
belongs to the family of NusG proteins, the only univer- 
sally conserved transcription factors (1). RfaH is a viru- 
lence factor that activates expression of horizontally 
transferred operons in Escherichia coli and several other 
human pathogens. These operons encode conjugation 
functions, lipopolysaccharide core, O-antigen, capsules 



and cytolytic toxins such as hemolysin (2), suggesting 
that RfaH may play a role in virulence. Indeed, deletion 
of RfaH attenuates virulence in Salmonella (3), but RfaH 
effects on pathogenesis in other bacteria remain to be 
elucidated. RfaH function requires a specific DNA 
element called ops (operon polarity suppressor) in the 
transcribed leader regions of its target operons (4) and is 
mediated by simultaneous RfaH contacts with the 
transcribing RNA polymerase (RNAP) and the 
translating ribosome (5). RfaH inhibits the activity of 
Rho, which terminates transcription of horizontally 
transferred genes (6), by three mechanisms: (i) increasing 
RNAP processivity; (ii) excluding NusG, a general tran- 
scription factor that potentiates Rho-dependent termin- 
ation, from the transcription elongation complex (TEC); 
and (hi) coupling transcription and translation, thereby 
blocking Rho access to mRNA (7). Together, these mech- 
anisms bring about dramatic effects: RfaH stimulates ex- 
pression of its target operons several hundred fold (7,8). 

RfaH antitermination function depends on a dramatic 
structural transformation (4). RfaH consists of two 
domains (Figure 1), the N-terminal domain (NTD) and 
the C-terminal domain (CTD). While the RfaH NTD 
shows the structural features typical for the NTD of 
NusG proteins, the CTD is an a-helical hairpin, in 
contrast to all other known NusG proteins where the 
CTD forms a five-stranded P-barrel. Additionally, RfaH 
CTD and NTD form a tight interface in the free protein. 
This is also untypical for members of the NusG family, as 
the two domains normally show no interaction (1,10); the 
only other known example where CTD and NTD interact 
is NusG from Thermotoga maritima; however, in this case, 
the CTD is in (3-barrel conformation (11), rendering the 
closed state of RfaH unique. 

In this closed form, the a-helical CTD (CTD-a) masks 
an RNAP-binding site on the NTD (12). During RfaH 
recruitment, transient interactions between the NTD and 
the TEC paused at the ops site are thought to trigger 
domain dissociation, allowing the NTD to establish 
contacts with RNAP that persist throughout transcription 
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Figure 1. RfaH transformation. In the closed autoinhibited state, inter- 
action surfaces for S10 and RNAP are masked. On dissociation, the 
RNAP-binding site on the NTD (green) becomes exposed, whereas the 
CTD (blue) refolds from an all-ot into an all-(3 state that interacts with 
S10. The flexible linker is depicted as a gray line. Arrows indicate 
interaction partners of each domain. PDB IDs: 20UG, RfaH; 2LCL, 
CTD. The figure was created using PyMOL (9). 



(13). After losing its contacts with the NTD, the CTD spon- 
taneously and completely refolds into an SH3 (3-barrel 
(CTD-fj), which is essentially identical to the NusG CTD 
(4). While the NTD is responsible for the universally 
conserved antitermination modification of RNAP by 
RfaH (1) and its homologs, it is the CTD that confers the 
unique properties of RfaH. Each of the alternative folds of 
the CTD is essential for the proper control of gene expres- 
sion. In the a-helical state, the CTD restricts RfaH action 
to o/7^-containing operons, avoiding interference with its 
paralog, the ubiquitous NusG (12,13). In the (3-barrel 
state, the CTD interacts with the ribosomal protein S10 
to enable translation in the absence of canonical 
ribosome recruitment elements (4). The two CTD states 
possess completely distinct well-defined folds and second- 
ary structure elements, each conferring a characteristic 
regulatory property, classifying RfaH as the first member 
of the class of transformer proteins (14). 

The P-barrel appears to be a preferred state of the CTD. 
The CTD folds as a (3-barrel in the structures of all other 
known NusG homologs (1) and when expressed separately 
or when released from the RfaH NTD on proteolytic 
cleavage of the interdomain linker (3). This raises the 
question of how the CTD is induced to fold into the 
a-helical state during translation of the full-length RfaH. 
One possibility is that the CTD-a forms only when 
assisted by transient interactions between the NTD and 
the emerging CTD, potentially facilitated by chaperones 
or by ribosome pausing within the C-terminal region of 
rfaH mRNA. Examples of ribosome pausing determining 
protein folding pathways have been reported (15) and a 
trigger-factor dependent pause was detected by ribosome 
profiling in rfaH (16). Alternatively, the CTD may initially 
fold as a (3-barrel and subsequently become converted into 
the a-helical form in the presence of the NTD. This idea is 
supported by equal distribution of the two structural 
states of the CTD observed in an RfaH variant with an 



E48S substitution in the NTD designed to destabilize the 
domain interaction (5). Such an equilibrium has not been 
observed, however, in wild-type (WT) RfaH, suggesting 
that the (3-barrel is at most poorly populated in the 
absence of the TEC. 

In this work, we used two approaches to distinguish 
between these models. First, we tested whether denatured 
RfaH can spontaneously refold into the native closed 
state. Second, we tested whether an RfaH variant in 
which the order of the domains is reversed is functional. 
Our results indicate that all information necessary to 
control its fold is encoded within RfaH itself, whereas 
accessory factors or sequential folding of NTD and 
CTD during translation are not essential for folding of 
active RfaH. 

MATERIALS AND METHODS 

Production, purification and refolding of WT RfaH 

Escherichia coli BL21 XJb (1DE3) cells harboring pIA238 
(12) were grown in Lauria Broth (LB) supplemented with 
kanamycin at 37°C. Once the OD 600 reached ~0.6, cells 
were induced with 0.2 mM of Isopropyl (3-D-l-thiogalac- 
topyranoside (IPTG), grown for 4 more hours and har- 
vested (centrifugation at 6000g 4°C for 15min). For 
purification in denaturing condition, cells were suspended 
in lysis buffer [50 mM Tris-Cl (pH 7.8), 8M Urea, 
500 mM NaCl and 0.25 mM Dithiothreitol (DTT)] for 
30min at room temperature followed by mild sonication. 
The lysate was clarified by centrifugation for 30min at 
40 OOOg. The supernatant was loaded onto Ni-NTA (GE 
Healthcare, Piscataway, NJ, USA) resin preequilibrated in 
lysis buffer. Next, the column was washed with lysis buffer 
containing progressively decreasing amounts of urea. 
Finally, RfaH was eluted by a step gradient of 50- 
200 mM imidazole in 50 mM Tris-HCl (pH 7.8), 
500 mM NaCl, ImM DTT. Eluted fractions were 
analyzed by sodium dodecyl sulphate-polyacrylamide gel 
electrophoresis, the fractions containing homogenous 
protein were pooled and dialyzed against storage buffer 
(10 mM Tris-HCl, 100 mM NaCl, 50% (v/v) glycerol, 
0.1 mM DTT, 0.1 mM EDTA) and stored at -80 or 
-20°C. Purification of native unlabeled WT RfaH, the 
isolated NTD and RNAP were carried out as described 
previously (12). 

Production and purification of domain-swapped RfaH 

Plasmid pIAl 181 encodes for His6-MP I12 - 
L 162 PKDIVDPATPYPGQ2-K 10 o domain-swapped (DS) 
RfaH. Here, subscript numbers denote the positions of 
respective amino acids in the WT RfaH. Escherichia coli 
BL21 XJb (1DE3) cells harboring pIAl 181 (encoding 
His 6 -tagged DS RfaH) were grown at 37°C in LB contain- 
ing kanamycin till OD 600 reached ~0.6, induced with 
0.2mM IPTG and grown for 14-16h at 18°C. Cells 
were harvested by centrifugation at 6000g 4°C for 
15min and lysed in 50 mM Tris-HCl (pH 7.9), 50 mM 
NaCl, 5% (v/v) glycerol, 1 mM P-mercaptoethanol and 
EDTA-free protease inhibitor (Roche, Indianapolis, IN, 
USA). Lysate was clarified by centrifugation for 30min 
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at 40 OOOg and protein was purified on nickel affinity and 
Superdex 75 columns (GE Healthcare). 15 N-labeled WT 
RfaH and the isolated CTD were purified as described 
previously (5); for 15 N-labelled DS RfaH, the procedure 
of WT RfaH was used. 

NMR spectroscopy 

Spectra were recorded on Bruker Avance 700 MHz or 
Avance 800 MHz spectrometers with cryogenically 
cooled triple-resonance probes equipped with pulsed 
field-gradient capabilities. NMR data were processed 
using in-house routines and visualized using NMRView 
(17). ['H^NJ-heteronuclear single quantum correlation 
(HSQC) spectra of 15 N-WT RfaH and 15 N-DS RfaH 
were recorded at 288 K, spectra of 15 N-RfaH CTD were 
recorded at 298 K. 15 N-WT RfaH and 15 N-DS RfaH were 
in 10 mM potassium phosphate buffer (pH 7.5), 50 mM 
KC1. All samples contained 10% (v/v) D 2 0. 

In the refolding experiments of 15 N-RfaH and 
15 N-RfaH-CTD ['H, 15 N]-HSQC spectra in refolding 
buffer [25 mM Tris-HCl (pH 7.5), 150mM NaCl] were 
recorded before denaturation. Urea was then added to 
8 M over a period of 30 min on ice, followed by an incu- 
bation of further 30 min on ice. Five hundred microliters 
were then centrifuged for 10 min at 16000g at 4°C to 
remove precipitate before the sample was analyzed by 
NMR spectroscopy. Proteins were refolded by stepwise 
dialysis at room temperature against refolding buffer con- 
taining 4, 2, 1 M urea and no urea; each dialysis step took 
at least 2h. The final dialysis step was carried out over- 
night and the samples were then concentrated using 
VivaSpin devices (Molecular Weight Cut Off (MWCO) 
3kDa; EMD Millipore, Billerica, MA, USA). The 
refolded protein samples were centrifuged for 10 min at 
16 000g and 4°C before analyses by NMR spectroscopy. 

Pause assays 

Thirty nanomolar linear DNA template generated by 
polymerase chain reaction amplification, 40 nM holo 
RNAP, 100 uM ApU, starting NTP subsets (5uM each 
of ATP and GTP, 1 uM CTP and lOuCi [a 32 P]-CTP, 
3000Ci/mmol) were mixed on ice in TGA10 buffer 
(20 mM Tris-acetate, pH 7.9, 20 mM Na acetate, 10 mM 
MgCl, 5% glycerol, 0.1 mM EDTA and 2mM 
P-mercaptoethanol). Halted radiolabeled TECs were 
formed by incubation at 37°C for 15 min. These were 
further mixed with 50 nM RfaH variants for 3 min. 
Elongation was started in presence of 25 ug/ml rifapentin 
by the addition of NTPs mix containing 1 50 uM of each 
ATP, CTP and UTP and 10 uM GTP. Aliquots were 
withdrawn at 7, 15, 30, 60, 120, 240, 480, 960 and 
1200 s. Reactions were stopped by the addition of an 
equal volume of STOP buffer (10 M urea, 50 mM 
EDTA, 45mM Tris-borate, pH 8.3, 0.1% bromophenol 
blue and 0.1% xylene cyanol). Samples were heated for 
2 min at 90°C and were separated on 6% acrylamide 
(19:1), 7M urea, 0.5x Tris-borate-EDTA (TBE) gels. 
The gels were dried, and RNA products were visualized 
by Typhoon FLA 9000 scanner and quantified using 
Image Quant software (GE Healthcare). 



Reporter assays 

The lux reporter vector pIA1087 (4) was co-transformed 
with the plasmids encoding His 6 -tagged WT (pIA957) or 
DS (pIA1183) RfaH under the control of the P BA d 
promoter, or an empty vector into a ArfaH strain (18) 
and plated on media containing lOug/ml tetracycline and 
100 ug/ml carbenicillin. Single colonies were inoculated into 
2 ml LB supplemented with antibiotics and incubated with 
agitation overnight at 37°C. The cultures were diluted into 
fresh LB with antibiotics to an OD 600 of ~0.05 and grown at 
37°C for additional 6h. Background expression from Pbad 
was sufficient to generate readily measurable signals. 
Luminescence was measured in 200 ul aliquots in triplicates 
on FLUOstar OPTIMA plate reader (BMG Labtech, 
Ortenberg/Germany) and normalized by cell density. 

Western blotting 

Cells were grown under the same conditions as for the lux 
reporter assay, collected by centrifugation, resuspended in 
700 ul of 50 mM Tris-HCl, pH 7.9, 100 mM NaCl, 5% 
glycerol, 0.1 mM DTT, 0.5mg/ml lysozyme and disrupted 
by sonication. The extracts were cleared by centrifugation, 
and total protein concentrations were determined by 
Bradford assay. Sample lanes contained ~30ug protein; 
for control lanes, 40 ng of purified RfaH was used. These 
were separated on a 12% sodium dodecyl sulphate Bis 
Tris gel (Invitrogen, Carlsbad, CA, USA). The proteins 
were transferred in Tris-glycine buffer, pH 8.3, containing 
20% methanol onto Trans-Blot nitrocellulose membrane 
(Bio-Rad Laboratories, Hercules, CA, USA) at 18.2 V for 
2h in a semidry Genie blotter (Idea Scientific, 
Minneapolis, USA). The blots were blocked overnight in 
lx PBS-T, pH 7.5, 0.2% Tween 20 containing 5% nonfat 
dry milk. Following incubation with rabbit polyclonal 
antibodies diluted 1:4000 in Phosphate- Buffered Saline- 
Tween (PBS-T) for 2h at room temperature, the 
membrane was washed and probed with rabbit IgG (GE 
Healthcare) for lh (1:10000 dilution in PBS-T), washed 
again and exposed to ECL detection reagents (GE 
Healthcare). Imaging was carried out on Chemi-doc 
XRS + Molecular imager (Bio-Rad). 

RESULTS 

In vitro assay to probe the interdomain contacts in RfaH 

We first probed the folding state of RfaH using transcrip- 
tion templates that allow us to monitor the activity of the 
NTD and its interactions with the CTD in an in vitro 
system that uses only purified RNAP and RfaH 
(Figure 2). These templates encode a hairpin-dependent 
his pause (hisP) site positioned downstream of either an 
ops pause (opsP) element specifically recognized by RfaH 
(Figure 2A) or a scrambled ops site, which specifies an 
equally strong pause but does not interact with RfaH 
(Figure 2B). Once recruited to RNAP, the NTD reduces 
pausing at the hisP site. Recruitment of the closed-state 
RfaH in which the CTD-a masks the NTD requires the 
ops element to trigger domain dissociation; i.e. full-length 
RfaH is active only on ops + templates in vitro and in vivo 
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Figure 2. RfaH regains function after refolding. (A) Transcription pause assays on a linear pIA349 (4) template (shown on top) with the T7A1 
promoter, the ops element, the start site (+1), transcript end (runoff), the pause sites that occur after the addition of U43, C45 and U145 (the hisP 
pause) and the hisT terminator indicated. Halted radiolabeled G37 TECs were preincubated with RfaH (native or refolded) at 50nM or storage 
buffer for 3min at 37°C, and then challenged with rifapentin at 25 ug/ml and NTPs (10 uM GTP, 150uM ATP, CTP, UTP). Aliquots were 
withdrawn at times ranging from 7 to 1200 s and analyzed on a 6% denaturing gel. (B) Assays on a linear pIA392 template (4) lacking the ops 
site were carried out with the isolated NTD and refolded RfaH. (C and D) The fractions of RNA at the hisP from the gels in A and B, respectively, 
plotted as a function of time. The representative results from at least three independent experiments are shown; the absolute values vary within 10% 
between repeats, whereas the trends remain the same. 



(12,13). Conversely, if the CTD is absent (or is not 
properly folded and thus fails to mask the RNAP- 
binding site), the NTD can be recruited to the TEC at 
any site, and is thus able to reduce pausing on ops~ tem- 
plates (12). Thus, comparison of RfaH antipausing 
activity on ops + and ops~ templates provides an indirect 
assessment of the state of the CTD. 

RfaH spontaneously refolds into the active state 

To test whether RfaH can correctly refold after denatur- 
ation, RfaH was absorbed onto Ni-NTA resin in the 



presence of 8M urea, allowed to refold by gradually 
removing urea during the washing steps, eluted and 
dialyzed into a standard storage buffer (see 'Materials 
and Methods' section for details). Antipausing function 
of this 'refolded' protein was first tested on an ops + 
template; RfaH purified under 'native' conditions (12) 
was used as a positive control. We found that both 
native and refolded proteins affected transcription simi- 
larly, delaying RNAP escape from the opsP site and 
reducing pausing at the hisP site (Figure 2). We note 
that the refolded protein was slightly less efficient in 
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reducing pausing at the hisP site (Figure 2B and C), 
perhaps because it failed to completely refold. Because 
we used nearly equimolar concentrations of RNAP 
and RfaH (40 and 50 nM, respectively), even a small re- 
folding defect would leave a fraction of RNAP 
unmodified. 

To be able to support these functions, the compact 
NTD must establish three separate sets of contacts with 
three different parts of the TEC, namely with the 
nontemplate DNA strand, with the p gate loop and with 
the P' clamp helices (P'CH). Thus, the refolded NTD 
appears to be folded properly. Next, we used an ops~ 
template to assess the state of the CTD. As described 
above, RfaH in the closed native state is not recruited to 
RNAP transcribing this template owing to failure of 
domain dissociation, and thus will fail to reduce 
pausing. We found that the isolated NTD, which was 
used as a control, reduced pausing on the ops~ template, 
whereas refolded RfaH (Figure 2B) or native RfaH (12) 
did not. Similarly, we showed that, in contrast to WT 
RfaH, the NTD activated expression of the ops~ lux 
reporter (18). These observations are consistent with the 
formation of the closed autoinhibited state of RfaH with 
CTD-a. 

Together, our results suggest that RfaH spontaneously 
folds into the native state in which the NTD and CTD-a 
form a tight interface. However, transcriptional activity of 
RfaH serves only as an indirect reporter of the folding 
states of the NTD and CTD and does not allow one to 
ascertain that RfaH completely refolds into the native 
structure. Therefore, we visualized RfaH refolding 
directly using NMR spectroscopy (Figure 3). Initially, a 
['H, 15 N]-HSQC spectrum of native ' 5 N-labeled RfaH was 
recorded to ensure that the protein was correctly folded. 
Then RfaH was denatured in 8 M urea, which resulted in a 
['H, 1s N]-HSQC spectrum typical of an unfolded protein, 
indicating that RfaH was completely denatured 
(Figure 3A). Subsequent stepwise removal of urea by 
dialysis finally yielded a ['H, 1 NJ-HSQC spectrum nearly 
identical to the one of native RfaH with the CTD in the 
a-form. Neither before denaturation nor after refolding, 
an equilibrium of different conformations of the CTD 
could be observed. The refolding experiment was 
repeated with the isolated CTD, which is normally in 
the P-barrel conformation (Figure 3B). This protein 
domain could also be completely denatured in 8 M urea, 
and it returned to the P-state on urea removal as observed 
by NMR spectroscopy. This clearly shows that, on the one 
hand, both full-length RfaH and the isolated CTD fold 
spontaneously from a denatured state and, on the other 
hand, the CTD folds into the a-state only in the presence 
of the NTD. 

In combination with the data of the in vitro transcrip- 
tion assays, these results indicate that RfaH regains its 
structure and functionality on removal of the denaturant 
and support the assumption that the presence of 
the NTD drives the CTD into the a-state and stabilizes 
it. However, they cannot exclude a possibility that 
co-translational folding may proceed at an accelerated 
rate. 



DS RfaH folds into a closed conformation 

An alternative approach to assess the requirements for co- 
translational folding of RfaH is to reverse the order of its 
domains in the amino acid sequence. Growing evidence 
suggests that although well-defined protein domains fre- 
quently fold as soon as they emerge from the ribosome 
exit tunnel, ribosome pausing at a key position may alter a 
folding pathway (15). For example, a completely folded 
NTD could interact with the N-terminal segment of the 
CTD and direct its folding into the a-helices. This 
templating action would be abolished if the order of 
domains were reversed (Figure 4A) because the P-barrel 
would likely fold before the NTD is made. In this case, 
one would expect that the CTD folds into the P-barrel 
when it emerges from the ribosome, just as it does when 
expressed alone. Subsequent translation and folding of the 
NTD would generate an open o/«-independent form of 
RfaH (12,18) with a large nonpolar surface exposed. 
Given that the NTD cannot be overexpressed separately 
and precipitates when separated from CTD (at concentra- 
tions exceeding 2uM), the open form is expected to be 
unstable in the cell. Only if the CTD can refold, the 
domain interface could be established, shielding the 
hydrophobic cavity on the NTD and conferring the re- 
quirement for the ops element. 

We constructed an RfaH variant in which the order of 
domains is reversed, extending the linker by two residues 
relative to the WT RfaH to facilitate interdomain 
contacts. We found that this DS RfaH behaved similarly 
to the WT protein in in vitro transcription assays 
(Figure 4B and C); most importantly, it did not work on 
the template lacking ops, indicating that the RNAP- 
binding surface on the NTD is masked by the CTD that 
presumably is in a-helical form. 

To ascertain that the DS RfaH assumes a closed state 
with the CTD in the a-form, we analyzed it by NMR 
spectroscopy. The ['H, 15 N]-HSQC spectra of the DS 
RfaH and WT RfaH were similar (Figure 4D), and 
signals typical for well-structured parts of the NTD and 
CTD-a of WT RfaH could be clearly found in the 
spectrum of DS RfaH. This allowed the assignment of 
residue numbers to signals from both the NTD and the 
CTD in the DS RfaH spectrum (e.g. residues 5, 6, 29, 33, 
51, 53, 68, 69 84, 88 from the NTD and residues 121, 132, 
134, 135, 147 and 157 from the CTD). Additional signals 
may be attributed to the additionally introduced amino 
acids and rearrangements in the linker region. These 
data indicate that DS RfaH is structurally similar to WT 
RfaH with the CTD in the a-helical state. 

DS RfaH is active in vivo 

Next, we tested whether DS RfaH is functional in vivo. We 
expressed at low levels the WT or DS RfaH variant in a 
ArfaH E. coli strain from a plasmid and measured the 
activity of luciferase encoded by the luxCDABE operon 
on a compatible plasmid. We previously showed that 
RfaH dramatically stimulates lux expression from 
vectors lacking a Shine-Dalgarno element (3), consistent 
with a key role of RfaH in coupling transcription to trans- 
lation. The WT and the DS RfaH increased lux expression 
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~ 1 70- and 50-fold, respectively, as compared with the 
empty vector (Figure 5A). The reduced (to ~25%) DS 
RfaH activity could be due to defects in folding or to 
reduced steady-state protein levels, for example, if 
the 'open' RfaH is targeted for cellular degradation 
(Figure 4A). Consistent with the second scenario, 
Western blotting using polyclonal antibodies specific for 
the CTD (18) revealed that the DS RfaH was present at 
~20% level relative to the WT protein under conditions 
used for the reporter assays (Figure 5B). Our results dem- 
onstrate that the DS RfaH is fully active in the cell. 

DISCUSSION 

RfaH CTD is able to transform from an a-helical hairpin 
into a P-barrel, and each state plays its specific regulatory 
role. Here, we asked whether RfaH CTD folding into the 
less stable a-helical state requires co-translational folding 
assisted by the ribosome or a ribosome-associate factor. 
We show that in the full-length RfaH, the CTD folds into 
an a-helical structure on refolding from the denatured 
state or even when synthesized by the ribosome before 
the NTD in the DS RfaH variant. In the latter case, we 
favor a scenario in which the CTD first folds into the 
stable (3-barrel, just as it is known from the isolated 
CTD (5), and its transformation into the a-form is 
induced as soon as the NTD becomes available. We 
cannot exclude a possibility that in the DS RfaH, 
folding of the CTD is delayed until the completion of 
the entire chain; however, other SH3 domains fold on a 



time scale of 10 ms (19, 20). These data demonstrate that 
co-translational folding is dispensable for the formation 
of the native RfaH fold and that the NTD, when tethered 
to the CTD, templates its folding into the a-helices. 

Our findings that the DS RfaH regains its native fold 
suggest that the closed form of RfaH could be regenerated 
after each cycle, when ribosome and RNAP dissociate 
from the nucleic acid chains at the end of an operon 
(Figure 6). If it remained bound to RNAP, RfaH would 
inhibit initiation because it competes with the a factor for 
binding to the core enzyme (21). However, this scenario is 
unlikely because RfaH binds to the isolated RNAP weakly 
(4), and its persistent contacts with RNAP observed 
during elongation in vivo (13) are likely stabilized by a 
network of protein/DNA interactions. Likewise, RfaH 
contacts with S10 are weak in the absence of other com- 
ponents (4). 

What happens to RfaH after dissociation? ChlP-chip 
data show that RfaH is restricted to o/>.v-containing 
operons in vivo (13), indicating that the open form does 
not persist in the cell. Thus, RfaH is either rapidly 
degraded or refolded into the closed autoinhibited form. 
We favor the second scenario in which the CTD plasticity 
allows RfaH to be recycled after each use, as the results 
obtained with the DS RfaH suggest that the CTD can 
undergo a p^a switch. This conversion remains to be 
demonstrated but we note that RfaH is optimized for 
readily shifting structures — we showed that disruption of 
a single interdomain salt bridge was sufficient to achieve 
an approximately equal distribution between the two 
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Figure 4. DS RfaH is active and folded. (A) Fates of the WT and DS RfaH. (B) Pause assays on pIA349, performed as in Figure 2A. (C) The 
fractions of hisP RNA from the panel A and Supplementary Figure SI, respectively, plotted as a function of time. The representative results from at 
least three independent experiments are shown; the absolute values vary within 10% between repeats, whereas the trends remain the same. (D) 
['H^NJ-HSQC spectra of WT RfaH (black) and DS RfaH (red) were recorded at 288 K in 10 mM potassium phosphate buffer (pH 7.5) containing 
50 mM KC1; both proteins were at 200 uM. Blue and green numbers correspond to the amino acids numbers of the NTD and CTD of WT RfaH, 
respectively. 



states in solution (4), implying that the oc/p equilibrium 
may be shifted by ligands binding to either RfaH domain. 

It is possible that other protein domains may have dras- 
tically different folds not predicted by homology modeling 
or even structures of individual domains if they are 
involved in interdomain interactions, interact with other 



proteins as part of a larger complex or bind an abundant 
ligand. In particular, our results suggest a tantalizing pos- 
sibility that ubiquitous P-barrel domains, which are 
present in many proteins in addition to NusG-like 
factors, may undergo similar changes on binding to their 
partners. Our studies have broad implications for the 
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understanding of structure, dynamics and evolution of 
proteins and underscore the limitations of studies of 
isolated protein domains. 



CONCLUSIONS AND PERSPECTIVES 

The fold of a given protein is primarily dictated by its 
amino acid sequence, allowing for de novo prediction of 
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Figure 5. The DS RfaH is functional in vivo. (A) Expression of a 
reporter lux operon lacking a Shine-Dalgarno element activated by 
RfaH (WT or DS) encoded on a compatible plasmid. The results are 
an average ± standard deviation of three independent experiments. (B) 
Steady-state levels of RfaH measured by western blotting under the 
conditions identical to A. The band shown as X is an unknown 
protein that cross-reacts with polyclonal anti-RfaH antibodies. 



protein structures (22). However, examples where short 
peptides (23) and small proteins (24) adopt different sec- 
ondary and tertiary structures depending on a broader 
context have been characterized. Even more remarkably, 
some proteins, such as the chemokine lymphotactin (25) 
and Mad2 (26), can interconvert between two states with 
different topologies. In contrast to these Metamorphic 
proteins, which preserve some, if not most, elements of 
their secondary structures, the RfaH CTD undergoes a 
complete a— >■ p switch to acquire a new role in translation 
(4). Similarly, the key molecular switch in prion diseases is 
a conversion of a monomeric a-helical cellular prion into 
an oligomeric infectious prion enriched in (3-sheets, and 
one among three current models postulates a complete 
a^p prion conversion (27). Beyond contextual inter- 
actions with other segments of the same peptide chain, 
many extrinsic factors, such as ribosome pausing (15), 
protein chaperones (28) and even osmolytes (29), can 
modulate the folding pathway. In the crowded cellular 
environment, proteins fold while surrounded by many po- 
tential interactors, from small molecules to nucleic acids to 
other proteins, and scenarios where proteins adopt one of 
alternative folds on binding to a ligand appear to be 
common. 

Together, these studies suggest that context-dependent 
changes of protein structure are widespread in nature and 
play key functional roles. In extreme cases, refolding could 
generate completely new interaction interfaces, which are 
masked in isolated structures. Serpins, for example, 
undergo large rearrangements on binding to their targets 
(30), and the RfaH CTD can only bind to S10 after total 
transformation from its a-helical state into a p-barrel (4). 
The ability to change its fold would greatly expand the 
regulatory repertoire of a given protein. We demonstrated 
that in the case of the transformer protein RfaH, all in- 
formation as to which fold to adopt is encoded in the 
protein itself, rendering RfaH a paradigm for exploring 
emerging principles of protein plasticity. 
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Figure 6. Refolding of the CTD could be reversible. Before recruitment, RfaH exists in a closed state, with the CTD-a masking the RNAP-binding 
site on the NTD. Following recruitment, RfaH opens up and the NTD establishes interactions with RNAP p gate loop (GL; magenta) and p' clamp 
helices (CH; orange) elements, which mediate antitermination. The CTD is released and refolded, making contacts to S10 that are proposed to 
facilitate ribosome (or 30S) loading. Both sets of contacts are likely maintained throughout expression of the entire operon but then are sequentially 
broken when first ribosome, and then RNAP terminate synthesis. Once both domains lose contacts with their respective partners, the NTD can mold 
the CTD into the a-helix, thereby completing the cycle. 
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